3574 results found.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
OpenSource
Size:
7.4 GByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
-
Paper track:5.8 Source separation and computational auditory s/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gene-Ping Yang | DEMAND | /N |
Documentation:
https://zenodo.org/record/1227121#.XJ7UnC33VQI
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
8.8 GByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
-
Paper track:5.8 Source separation and computational auditory s/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gene-Ping Yang | WSJ0 | /N |
Documentation:
https://catalog.ldc.upenn.edu/LDC93S6B
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Existing-used
Use:
ASR rich transcription
-
Paper title:Leveraging a character, word and prosody triplet for an ASR error robust and agglutination friendly punctuation approach
-
Paper track:10.5 Rich transcription/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | György Szaszák | IWSLT 2011 Speech Translation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
1000 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:GPU-based WFST Decoding with Extra Large Language Model
-
Paper track:9.5 Search methods, decoding algorithms, lattices,/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daisuke Fukunaga | LibriSpeech | /N |
Documentation:
https://www.danielpovey.com/files/2015_icassp_librispeech.pdf
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
21108664 KByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Completely Unsupervised Phoneme Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
-
Paper track:10.7 New paradigms (e.g. artic. models, silent spe/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kuan-yu Chen | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
Yes; english; Yes
Speech
Evaluation Data,
Language Type:
Multilingual
Languages:
English French Mandarin Chinese
Availability:
From Owner
License:
Size:
4217 minutes Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Speaker Adversarial Training of DPGMM-based Feature Extractor for Zero-Resource Languages
-
Paper track:10.8 Zero-resource speech recognition/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yosuke Higuchi | ZeroSpeech 2017 | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile
-
Paper track:3.3 Automatic analysis of speaker states/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chi-Chun Lee | IEMOCAP | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English French
Availability:
Freely Available
License:
Size:
33192 sentences Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings
-
Paper track:12.19 Other topics in Spoken Language Processing: /Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marcely Zanon Boito | English-French Parallel Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
I2R
Size:
20 GByte Production Status:
Existing-used
Use:
Person Identification
-
Paper title:A Unified Framework for Speaker and Utterance Verification
-
Paper track:4.3 Speaker verification and identification/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Maulik Madhavi | RSR2015 database | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English Spanish
Availability:
From Owner
License:
LDC
Size:
40 MByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Direct speech-to-speech translation with a sequence-to-sequence model
-
Paper track:12.2 Speech-to-speech translation systems/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ye Jia | Fisher and CALLHOME Spanish--English Speech Translation | /N |
Documentation:
None




